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1 The Main Result 

Let r(k, N) be the maximal cardinality of a subset A of {1, 2, ... , N} which does not contain 
an arithmetic progression of length k. That is, A does not contain a subset of the form 
{x + jy : < j < k}, where x, y are integers with y ^ 0. 

Erdos and Turan || initiated the study of these quantities in 1936. In particular they 
conjectured that r(k, N) = o(N) for all k, that is every set of integers of positive asymptotic 
density contains arbitrarily long arithmetic progressions. In 1953, Roth |§ showed that 



r(3, N) = o(N). The Erdos-Turan conjecture was verified by Szemeredi ||TT], 0, a result 
with a very broad influence. Subsequently, rather different proofs of Szemeredi's theorem 
were given by Furstenberg M and Gowers [|5], E|. Gowers's proof provides, for the first time, 
upper bounds on r(k, N) given by a bounded tower of exponentials. An intriguing question 
of Erdos asks if r(3, N) < C N / (log N) 1+s for some positive 5. Bourgain's article contains 
the best current upper bound of CN^J^f^^ on r(3, N) . 

In this article we are interested in the converse question of finding large subsets of 
{1,...,N} which do not contain arithmetic progressions. Behrend, in 1946, |IJ (building 
on earlier work of Salem and Spencer [|10]) considered three term arithmetic progressions, 
and showed that r(3, N) > N exp(— CyTog N). The purpose of this paper is to show that 
if one considers longer arithmetic progressions then Behrend's estimate can be further im- 
proved as follows. 
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Theorem 1 There is a constant C > so that for all n > k > 1, 

r{l + 2 k ,N) > iVexp(-C(logiV) 1/(fc+1) ). (1.1) 

2 The Proof 

Our argument builds upon the methods of Salem and Spencer |H| and of Behrend jlj. It 
will be convenient to consider the set / = Zfl (— ^y^, ^y^] instead of {1,2,..., N}. First, we 
may assume that N = n d for suitably chosen integers n and d, with n much smaller than N 
and divisible by a constant c (independent of N, n) to be chosen later. Indeed, at the cost 
of a slightly larger constant in our theorems we can always increase n to one of these values. 
Similarly, we shall take fractional powers and logarithms of large integers and tacitly assume 
that the output is also an integer. In fact the argument requires the integer parts of these 
quantities, but to minimize notation we do not explicitly invoke the integer part function. 

Second, with N = n d , consider the expansion of each x G / in base n, defined as follows. 
For any x G / we define its coordinate vector v x = (x , . . . , Xa-i) G Z d , where x« are uniquely 
determined by the conditions 

. n - 1 ra-l . . 

x = 2^ Xin 1 , — < Xi < ——. (2.2) 

Note that, unlike in Behrend's argument, the "digits" X{ are not required to be non-negative. 
Denote also the "norm" of x G / as 

d-l 

II ||2 _ || ||2 _ \ " 2 
|| 3^ || 1 1 ^xH / j Xj , 

i=0 

with Xi defined by ( |2 . 2|) . 

An important observation of Salem and Spencer |10 was that if we only consider the set 



Qo of numbers x G {0, 1, . . . , N — 1} with digits < xi < cn, where c is sufficiently smallQ, 
then addition of numbers is equivalent to vector addition in the corresponding subset of Z d , 
i.e. for any x,y,z G Qo we have x + y — z if and only if v x + v y — v z . Thus an arithmetic 
progression x, x + y, x + 2y, . . . in Q corresponds to vectors v x , v x+y , v x+ 2 y , ... on a straight 
line in Z d . 

We shall rely on variants of this observation. More precisely, we define 

Q = {i 6 Z : X = 5Z x i n% i ~ 1 — x i — (2-3) 

i=0 



1 Salem and Spencer considered expansions with non- negative digits < Xi < n — 1, in which case it 
suffices to take c = 1 /2 
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where q = n/c and c is a large constant independent of N, n to be chosen later. We will 
also denote for r € Z: 

d-l 

rQ = {x G Z : x = x^n*, — rq < Xi < rq}. 

i=0 

Then linear combinations of numbers in rQ with small enough integer coefficients correspond 
to linear combinations of their coordinate vectors: 

v J2 a k* (k) = J2 a k v x^ if a k ,r k G Z, x (k) G r k Q, ^r k \a k \ < c /3. (2.4) 

Our proof consists of two distinct parts, both similar in spirit to Behrend's argument [0. 
The latter relies on the geometrical fact that a straight line can intersect a sphere ||fx|| 2 =r 
in Z d in at most two points, so that the set {x £ Q : \\x\\ 2 = r} cannot contain a three-term 
arithmetic progression. One then uses pigeonholing to choose a sphere containing a large 
number of points in Q. 

Our intermediate results can be stated in terms of quantities closely related to those 
of Erdos and Turan. Namely, define r m (k, N) to be the maximal cardinality of a subset 
Ac {0, 1, . . . , N — 1} which does not contain a further subset of the form 

m 

{x + Y^dif : 0<j<fc-l}, (2.5) 
i=i 

for any integers x and dj such that at least one of the a, is non-zero. (In particular, ri(k, N) = 
r(k,N) and r m (k,n) decreases with m.) Observe that a set of the form (|2.5|) with u > 2 
may contain less than k distinct integers, as the same summand may arise from more than 
one value of j. Note further that the eij need not belong to A. Finally, while this is defined 
as a property of the initial interval of integers {0, . . . ,N — 1}, it depends only on the length 
of the interval of integers in question. 

The estimates we will need are the following. 

Proposition 2 We have 

r m (2m + l,N) > N exp(-C ifiogN), (2.6) 
where C is an absolute constant depending only on m. 

Proposition 3 Assume that N = n d , and let k > m + 1. Then 

r m( k,N)>N r -^>, (2.7) 
where the constant c > depends only on m and k. 
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Proposition ^| is proved by essentially repeating Behrend's argument with straight lines 
replaced by curves of higher order; the main point is that a non-constant polynomial of 
degree 2m can have at most 2m roots. Proposition [5] will allow us to carry out the inductive 
argument. Instead of just one sphere as in Behrend's argument, the set A which provides 
the lower bound in ( |2.7|) will be a union of concentric spheres of radii \/r, r e R. We will 
argue that if A contains a subset {x^} as in (|2.5| ), then the squared norms ||x^|| 2 are as in 
( p.5|) with m replaced by 2m. Proposition ^| will follow upon choosing a set R of cardinality 
f2m{k, dn 2 ) which cannot contain such a subset, and optimizing over n and d. 

We will use C, c, q, etc. to denote absolute constants which may depend on m and may 
change from line to line but are always independent of N, n, d. 



3 Proof of Proposition g 

Our goal in this section is to find a set R C {0, 1, . . . , iV — 1} of large cardinality such that 
R does not contain all of the integers 

m 

»</ : J = 0,l,...,2m (3.8) 

i=0 

for any clq, . . . , a m G Z with 7^ for at least one i > 0. We will use the notation of Section 
0. In particular, we will replace the set {0, 1, . . . , N — 1} by /, and assume that N = n d for 
some 1 d iV and 1 < n <C (eventually we will let d ~ \/\ogN). The set R will be a 
subset of the set Q defined in ( |2.3j ) . 

Lemma 4 Suppose that 2m + 1 numbers x^' in Q satisfy 

m 

»./"• 3 =0,1,..., 2m, (3.9) 

i=0 

for some integers clq, . . . , a m . Denote by D the Vandermonde determinant D = D m = \J m \, 
where J m = (j l )™ =1 . Then there is a constant c, depending only on m, such that 

Dai ecQ, i = 0,...,m. (3.10) 

Furthermore, if the constant Co in the definition of Q was chosen large enough, then we have 
for any such numbers 

m 

Dv x0) =Y,f v D^ J = 0,1,..., 2m. (3.11) 

8=0 
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Proof. We consider the first m + 1 equations in ( |3.9| ) as a system of linear equations with 
unknowns do, . . . , a m . By Cramer's formula, Da^ are linear combinations of x^' with integer 
coefficients bounded by a constant depending only on m. This implies (|3.10 ). Now fl3.11| ) 
follows from ( 01Q|) , (p|) and ((2 



We are now in a position to run Behrend's argument. Let 

S r = {x E Q : \\x\\ 2 = r}, 

where ||x|| 2 = \\v x 

II 2 = EioN 2 - We win 

prove that no S r may contain 2m + 1 points 

as in (|3.9|). Indeed, suppose to the contrary that x^\ j = 0,1, ... ,2m, satisfy ( |3.9| ) and 
11^,0) || 2 _ r _ gy L emma |j ; we have 



/, (I / — u 



-J 



But then -P(j) is a polynomial of degree 2m in j, equal to r for j = 0, 1, . . . , 2m. This is not 
possible unless P(j) is constant, in which case we must have (Da,i)k = for all < k < d — 1 
and all 1 < % < m. By Lemma (| again, it follows that cij = for all 1 < i < m. 

Finally, we use a pigeonholing argument to find a set S r of large cardinality. Following 
Behrend [|], we set d = ^\ogN and n = N^ d , so that q = JV^/IOOO. Since Q has 
cardinality {2q) d and Q = Ur=o $r, there is at least one r for which 

logA^s 



N-^Sr > (d500V) _1 > Ci exp(-C 1 d)iV- 2/c( > dexp (-C 1 (d + 



d 



Taking d = yToglV proves the proposition. 

4 Proof of Proposition [3] 

We continue to use the notation of Section ^ we assume that N = n d with n,d <^ N, and 
define g, Q, v x , \\x\\, D, etc. as before. We also define 

d-l 
i=0 

for x, y e yQ. 

Let i? C {0, 1, ... , D 2 dq 2 — 1} be a set of cardinality r 2m (/c, D 2 dq 2 ) which does not contain 
all of the integers 

2m 

y U)=J2a>if, j = 0,l,...,fc-l, (4.12) 

i=0 
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for any do, ... , ci2m £ Z. Observe that any translate i? + s := {r + s : r G -R}, s G Z, of i? has 
the same cardinality as R and cannot contain k integers as in ( |4.12| ). Let X = 2qY,iZo ri l G 
2Q and S := {0, 1, . . . , 9D 2 dq 2 }. For s G 5, define 

4 = {i£Q: D 2 ||x-X|| 2 G + 

We claim that no A s can contain k integers 

m 

x U) = Y / b lJ \ j = 0,1,..., A; -1. (4.13) 

i=0 

Indeed, suppose to the contrary that A s does contains such k integers. As in Lemma f|, we 
prove that 

m 

Dv xU) _ x = J2f v Db t - Dv x , j = 0, . . . , k - 1, 

i=0 

provided that cq was chosen large enough. Hence 

d— 1 m „ 

D 2 \\x® -Xf = E(Ej'»-2^) 

fc=0 i=0 

are as in (|4.12|) . But this is impossible by the choice of R. 

A pigeonholing argument shows that there is an A s with large cardinality. For any x G Q 
we have q < (X — x)i < 3q for each i, hence D 2 dq 2 < D 2 \\x — X\\ 2 < 9D 2 dq 2 . Hence for any 
x G Q and r G R we have 

1 < \\x-Xf -r < 9D 2 dq 2 } 

and in particular there is a s G S such that D 2 ||a; — A|| 2 = r + s. It follows that for each 
x £ Q there are at least #i? values of s such that i64 Hence 

E #A > #i? ■ #g. 

In particular, there is an s G S 1 such that 

* s ~ #5 " 1000 d ' D 2 n 2 d ' 

which yields ( |2.7|) . 
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5 Proof of Theorem ft] 

We will prove that for all 1 < k <C logiV and all 1 < I < k, 

r 2k -i(l + 2 k , N) > iVexp ( - c(logiV)^). (5.14) 
In particular, taking I = k we obtain (|1.1|). Here and below, the constants c,c',c" may 



depend on m, k, I, but not on N. 

The proof of ( |5.14| ) is by induction in /. The case I = 1 is Q2.6|). Suppose now that Q5.14| ) 
holds for I, and set N = n d , d~ (log jV)V0+2). Then by (fT7|) we have 

r 2k -i-i(l + 2 fe , iV) > N r _2^±^2^1 > iv c - d exp ( - c'(log(n 2 d))TTi) 

c d n 2 d 

> iVexp( - c"(logA^)^2), 

which is ( |5"T4D for / + 1. 
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